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Abstract 

Observational learning is an important information aggregation mechanism. However, it occa- 
sionally leads to a state in which an entire population chooses a sub-optimal option. When this 
occurs and whether it is a phase transition remain unanswered. To address these questions, we 
performed a voting experiment in which subjects answered a two-choice quiz sequentially with 
and without information about the prior subjects' choices. The subjects who could copy others 
are called herders. We obtained a microscopic rule regarding how herders copy others. Varying 
the ratio of herders led to qualitative changes in the macroscopic behavior of about 50 subjects 
in the experiment. If the ratio is small, the sequence of choices rapidly converges to the correct 
one. As the ratio approaches 100%, convergence becomes extremely slow and information aggre- 
gation almost terminates. A simulation study of a stochastic model for 10^ subjects based on the 
herder's microscopic rule showed a phase transition to the two-peaks phase, where the convergence 
completely terminates as the ratio exceeds some critical value. 

PACS numbers: 05.70.Fh,89.65.-s,89.65.G 
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I. INTRODUCTION 



The tendency to imitate others is one of the basic instinct of human. People effectively 
and inadvertently act as filters to provide the information that is most useful for an ob- 
server. Imitation and copying is a highly adaptive means of gaining knowledge jl| . It 
presumably results from an evolutionary adaptation that promoted survival over thousands 
of generations. It allows individuals to exploit the hard-won information of others 
However, imitating or copying others has disadvantages. The acquired information might 
be outdated or misleading {2, 3|. Copying wrong information might lead to herding, where 
an entire population makes a wrong decision. This is referred to as information cascade or 
rational herding j4-7|. Unfortunately, because imitation is a basic instinct and because it is 
economically rational to copy others, humans might not be able to evade such a catastrophic 
situation . Social influences have many forms, including imitation, conformity and obedi- 
ence jsj. Recent studies in cognitive neuroscience suggest that some type of imitation occurs 
automatically via the actions of "mirror neuron" systems jol. 

In the field of social psychology, many studies have focused on how humans use social 
information at the microscopic level {2], [lo| . In the field of finance and economics, it is now 
widely believed that investors are influenced by the decisions of others and that this influence 
is a first-order effect {4, 5|. We now have a number of interest ing mode ls of rational herding 



based on simple, straightforward, and convincing intuition 
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17l | . Empirical financial 



research has focused on macroscopic data primarily because such data is easily available. 
Micro- macro aspects of information cascade have been studied in 3|. It was concluded 
that the information cascade is fragile and self-correcting. Even if the population makes a 
wrong decision at a point in the choice sequence, it will eventually turn to a correct choice. 
The analysis was based on a stochastic model, and the asymptotic behavior of the empirical 
choice sequences was not studied in detail. In order to study the nature of information 
cascade and, furthermore the possibility of phase transition, it is necessary to connect the 
micro-macro aspects without depending on a model assumption as far as possible. However, 
thus far, no empirical work has directly connected the microscopic and macroscopic aspects 
of information cascade and herding. 

Two types of phase transitions have been predicted in a two-choice voting model depend- 
ing on the strength of conformity of people types of individuals: herders 



and independents. The voting of independents is based on their fundamental values, while 
the voting of herders is based on the number of previous votes. If the herders are analog 
herders and they vote for each choice with probabilities that are proportional to the^ choices' 
votes, there occurs a transition between the super and normal diffusion phases [19]. If the 
independents are the majority of voters, the voting rate converges at the same rate as in 
a binomial distribution, which is called the normal diffusion phase. As the proportion of 
herders is over 50%, the voting rate converges more slowwly than in a binomial distribution: 
this is called the super-diffusion phase. However, the presence of herders does not affect the 
accuracy of the majority's choice. If the independents vote for the correct choice rather than 
for the wrong one, the majority of voters always choose the correct choice. The probability 
distribution of the voting rate has only one peak, and these two phases are collectively re- 
ferred to as the one-peak phase. In the digital herder case, where herders always choose the 
choice with the majority of previous votes, the majority's choice does not necessarily coin- 
cide with the correct choice, even if the independents vote for the correct choice rather than 
for the wrong one. When the fraction of herders increases, there occurs a phase transition, 
beyond which a state where most voters choose the correct choice coexists with one where 
most of them choose the wrong one [2^. If the fraction of herders is below the threshold 
value, most voters choose the correct choice and the system is in the one-peak phase. If 
the fraction is above the threshold value, the distribution of the voting rate has two peaks 
corresponding to the two coexisting states: this phase is called the two-peaks phase. We call 
the phase transition between the one-peak and two-peaks phases the information cascade 
transition. 

In this paper, we have adopted an experimental and theoretical approach to the study of 
the phase transitions in information cascade . The organization of the paper is as follows. 
We explain the experimental design and procedure in section |Tll Section UTTl is devoted to the 
analysis of the experimental data. We show that varying the ratio of herders led to qualitative 
changes in the asymptotic behavior of the convergence of the voting rate. In section llVt 
we introduce a stochastic model which simulates the system. We obtain a microscopic rule 
regarding how herders copy others. We performed a simulation study of a stochastic model 
for 10^ subjects based on the herder's microscopic rule. The model showed the information 
cascade transition to the two-peaks phase, where the convergence completely terminates 
as the ratio exceeds some critical value. Section |V] is devoted to the conclusions. In the 



appendices, we explain the experimental setup in detail. 

II. EXPERIMENTAL DESIGN AND PROCEDURE 

TABLE L Experimental design. 
Experiment T {r} M Subject pool System 

2010A 31 {0,1,2,3,5,7,9,00} 100 Kitasato Univ. Face-to-Face 
2010B 31 {0, 1, 2, 3, 5, 7, 9, 00} 100 Kitasato Univ. Face-to-Face 
2011A 52 {0,1,5, 11,21, cx)} 120 Hokkaido Univ. Web 
2011B 52 {0, 1,5, 11, 21, (X)} 120 Hokkaido Univ. Web 

The experiments reported here were conducted at the Information Science Laboratory 
at Kitasato University in October 2010 and at the Group Experiment Laboratory of the 
Center for Experimental Research in Social Sciences at Hokkaido University between June 
2011 and July 2011. The subjects included students from the two universities. We call the 
former experiment EXP2010 and the latter EXP2011. In EXP2010 (EXP2011), the number 
of individuals T was 31 (52). We prepared two groups of subjects, group A and group B. 
In Total, 62 (104) subjects participated in EXP2010 (EXP2011). There were two sequences 
of subjects, and we denote the order of each subject by t G {1,2, ■ ■■ ,T}. The number 
of questions M in the two-choice quiz is 100 (120) in EXP2010 (EXP2011). Interaction 
between subjects in each group was permitted only through the social information given by 
the experimenter (Face-to-Face) in EXP2010 or by the experiment server (Web) in EXP2011. 
Table |T] summarizes the design. 

The subjects answered the quiz individually with and without information about the pre- 
vious subjects' choices. This information, called social information, is given as the summary 
statistics of the previous r subjects {Co(r, t), Ci(r, t)}. We denote the t + 1-th subject's 
answer for case r by A(r, t + 1), which takes the value 1 (0) if the choice is true (false). 
{Co(r, t), Ci(r, t)} are the numbers of subjects who choose each choice among the prior r 
subjects as Ci(r, t) = X]t'=t-r+i "^(^' ^^"^ C*o(r, t) = r — Ci(r, t). The choice of r is 
r G {r} = {0, 1,2, 3, 5, 7, 9, oo}({0, 1,5,11,21,00}) in EXP2010 (EXP2011). Here, r = 
means that the subjects receive no information and must answer with their knowledge only. 
In the case r = 00, the summary statistic is calculated from all previous subjects' choices. 
The subjects answered the quiz with their knowledge only (r = 0) initially. Next, they 
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answered with {Co{r,t),Ci{r,t)} from r = 1 to r = oo in increasing order of r in the set 
{r}. Any differences between the choices in r = and r > 1 can be attributed to the social 
information. 



First Answer 



You have already answered 59/120 questions, 
No. 30 Quiz 

Quiz : Which composer is famous for The Symphonie No. 6 "Pathetique"? 
Piease choose. 



Tchaikovsky 


Beethoven 







[ Answer j 



Answer r=l 



Quiz : Which composer is famous for The Symphonie Mo, 6 "Pathetique"? 
We show you the previous subject's choice. 
His choice is as foiiows. 
Piease select your choice. 



Tchaikovsky 


Beethoven 


1 









[ Answfir J 



All previous subjects' Info. 



No, 30 Quiz 

Quiz : Which composer is famous for The Symphonie No. 6 "Pathetique"? 
Up to now 31 subjects have answered. 
Their choices are as follows. Please choose. 



Tchaikovsky 


Beethoven 


9 


22 ^ 







[ Answer | 



FIG. 1. Snapshot of the screen for r = 0, 1, and oo in EXP2011. The summary statistics 
{Co(oo, t), Ci(oo, t)} are given in the second row in the box in cases r = 1 and oo. 

Fig. [1] shows the experience of the subjects in EXP2011 more concretely. The subjects 
entered the laboratory and sat in the partitioned spaces. After listening to a brief explanation 
about the experiment and the reward, they logged into the experiment web site using their 
IDs and started to answer the questions. A question was chosen by the experiment server and 
displayed on the monitor. First, subjects answered the question using their own knowledge 
only (r = 0). Later, subjects received social information and answered the same question. 
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Fig. [T] shows the cases r = 1 and r = oo. Subjects could then use or ignore the social 
information when making decisions. 



A. Experimental procedure 

We here explain in detail the procedure and the experience of the subjects in the ex- 
periment 21|. A subject answered a question with no public information (r = 0) initially. 
The answer was denoted as X(0,t) for the t-th subject in the subjects' sequence. If the 
subject was first {t = 1), he answered only in the case r = 0. X(0, 1) was copied to 
X{r, 1) as X{r, 1) = X(0, 1) for r > 1 for later convenience. If t > 1, the experimenter (in 
EXP2010) or the server (in EXP2011) calculated the social information {Co{r,t),Ci{r,t)} 
and gave it to the subject. If t = 2, the subject answered the question in case r = oo with 
{Co(oo, 1), Ci(oo, 1)} and the answer was denoted as X(oo,2). Here, Ci(oo,l) is X(0,1). 
By the convention X(r, 1) = X(0, 1) for r > 1, we can write Ci(oo, 1) = X(oo, 1). As 
^1(1,1) = ^1(00,1), we copy X(cx),2) to X(l,2) as X(l,2) = X{oo,2). As in t = 1, 
X(l,2) = X(cx),2) is copied to X(r,2) as X(r,2) = X(l,2) for r > 1. If t = 3, the 
subject answered the question in case r = 1 with {Co(l, 2), Ci(l, 2)}, and the answer was 
denoted as X(l,3). The social information {Co(l, 2), Ci(l, 2)} was calculated with the an- 
swer X(l,2) as Ci(l,2) = X(l,2) by the copy convention X(l,2) = X(oo,2). Then, the 
subject answered in case r = 00 and the answer was denoted as X(oo,3). Ci(oo,2) is 
Ci(oo, 2) = X(0, 1) + X(l, 2), which can be written as Ci(oo, 2) = X(oo, 1) + X(oo, 2). For 
r > 1, X(oo, 3) is copied to X(r, 3). By the copy convention, the social information in case 
r can be expressed with the answers {X(r, t)} in case r only. If t = 4, the subject answered 
the question in case r = 0, which is written as X(0,4), and then in case r = 1, written as 
X(l,4). Next, in EXP2011, the subject answered in case r = 00, written as X(oo,4). The 
social information is Ci(l, 3) = X(l, 3) and Ci(oo, 3) = X(oo, 1) +X(oo, 2) +X(oo, 3). Fig. 
[2] gives the pictorial explanation of the procedure. 

In general, if the order of the subject is t, there is no public information for r > t — 1. 
There is the maximum value Vmax in the set {r} that satisfies Vmax < t — 1. The subject 
answered from case r = to case Vmax in the set {r} in increasing order of r. Then, 
the subject was given the social information from all priors (r = 00) and answered in 
case r = 00. He did not answer cases r > Vmax in the set {r} and the answer in case 



{Co(r,t),Ci(r,i)} 




t+i 




1 



2 



3 



4 



© 

.X(0, 1) 



© © © 
.X(0, 2) X(0, 3) X(0, 4) 




^ © , © 

X(l,3)-^-X(l,4) 



FIG. 2. Pictorial explanation of experimental procedure. A subject answers the quiz questions in 
increasing order of r from r = to r = oo. The t + 1-th subject answers in case r with social 
information {Co{r,t),Ci{r,t)} and the answer is denoted as X(r,t + 1). The first subject (t = 1) 
answers only in the case r = 0, which is written as X{0, 1). The second subject (t = 2) answers 
in the cases r = and r = oo, which are written as X{0, 2) and X{oo, 2), respectively. The third 
subject answers in case r = 0, then in case r = 1, and finally in case r = oo; these are written 
as X(0, 3), X(l, 3), and X(oo,3), respectively. The number in the circle indicates the order of 
the answer for each subject. The number in the arrow indicates the memory length of the social 
information. For r = 1, it is one. For r = oo, the t-th subject receives t — 1 previous subjects' 
information, and the length is t — 1. 

r = oo was copied to the unanswered cases as X{r,t) = X{oo,t) for r > r^ax in set {r}. 
The answer in case r started from the r + 2-th subject in the sequence. For t < r + 2, 
X{r,t) = X{oo,t) by the copy convention, and {Co{r,t),Ci{r,t)} can be written using only 
{X{r,t)} as Ci(r, t) = X]t'=i-r+i "^(^' ^^'^ Co{r,t) = r — Ci{r,t). We use the same 
conventions when we prepare a sequence of choices in case r. All sequences of choices 
{X{r,t)} start from t = 1. The percentage of correct answers up to the t-th subject is 
defined as Z{r,t) = ^l>=iX{r,t')/t, and the final value is Z{r,T). 

B. Quiz selection 

We explain the choice of the questions in the quiz. In the experiment, it was necessary 
to control the difficulty of the questions. If a question is too easy, all subjects know the 
answer. If the question is too difficult, no subjects know the answer. In order to study 
social influence by varying the ratio of people who do not know the answer, it is necessary 
to choose moderately difficult questions. In EXP2010, we selected 100 questions for which 
only one among the five experimenters knew the answer. This choice means that the ratio 
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TABLE II. Five typical questions from the two-choice quiz in EXP2011. q is the label of the 
questions in the quiz and g G {0, 1, 2 • • • , 119}. 

q Question Choiceo Choicei Answer 

Which insect's wings flap more in one minute? mosquito honeybee 

1 During which period did the Tyrannosaurus Rex live? Jurassic Cretaceous 1 
3 Which animal has a horn at birth? rhinoceros giraffe 1 

7 Which is forbidden during TV programs in Korea? commercials kissing scenes 

8 Which instrument is in the same group as the marimba ? vibraphone xylophone 1 

is estimated to be around 80% for the subjects. After EXP2010, we calculated Z{0,T) for 
each question. In general, Z{0,T) > 0.5 for two-choice questions. A too-small value of 
Z{0,T) indicates some bias in the given choices of the question. We excluded questions with 
too-small values of Z{0,T) and prepared a new quiz with 120 questions in EXP2011. Table 
IIT] shows flve typical questions from EXP2011. 



III. DATA ANALYSIS 

In the analysis, the subjects are classifled into two categories, independent and herder, 
for each question. If a subject knows the answer to a question with 100% confldence and 
the answer is not affected by others' choices, he/she is categorized as independent. If the 
subject does not know the answer and if he/she may be affected by others' choices, he/she 



is categorized as herder [15|, ll9|, l20|. We assume that the probability of a correct choice 
for independent and herder subjects to be 100% and 50%, respectively. For a group with p 
herders and 1—p independent subjects, the expectation value of Z{0,T) is E{Z{0,T)) = 
1 —p/2. For each question in each group, we estimate phj p = 2(1 — Z{0, T)) as the maximal 
likelihood estimate. The assumption of the random guess (50%) by the herder might be too 
simple. As Z{0,T) approaches 0.5 and almost all subjects do not know the answer to the 
question, p approaches 100% and the estimate works well. 

Our experimental and analytical design has three advantages over both theoretical models 
and observational studies, (i) We control the amount of social information that the subjects 
receive by the change in r. This enables us to derive a microscopic rule for human decisions 
under social information 22|. (ii) Based on the answers in the absence of information 
(r = 0), we can estimate the ratio of herders p, which will enable us to extract the herder's 
decision rule from the results in (i). (iii) Our analysis focuses on the asymptotic behavior of 



the convergence of Z{r,t) with fixed herder's ratio p. We clearly see the collective behavior 
of humans and the qualitative change when p is varied. In par ticular, we can study the 
possibility of the information cascade transition [tI, 14, 19, 20, 23 1. 



We include a note about the controllability of p in the experiment. In the experiment, 
after all T subjects answered, we calculated Z{0,T) and estimated the herder's ratio p as 
p = 2(1 — Z{0, T)). It may seem impossible to control p in the experiment, but this is not so. 
We think p is an inherent property of two-choice questions. If we can estimate p for a large 
number of subjects T » 1, we can apply the same value to the experiments with other 
groups. We have compared the two values of Z{0, T) of group A and B for the same question 
in EXP2011. Pearson's correlation coefficient p is 0.82, and there is a strong correlation. 
The system size T in EXP2011 is very limited (T ^ 50) and there remains some fiuctuation 
in the estimation of p, but it will disappear for a large enough T. We can know p in advance 
and control it in voting experiments. 



A. Distribution of Z{r,T) 

There are 2 x 120(100) samples of sequences of choices for each r in EXP2011 (EXP2010). 
We divide these samples into 11 (9) bins according to the size of Z{r,T), as shown in Table 
IIIIR ( jlllB ). The samples in each bin share almost the same value of p. For example, in the 
sample in the No. 6 bin (0.45 < Z{0,T) < 0.55) in EXP2011, there are almost only herders 
in the subjects' sequence and p ^ 100%. On the other hand, in the sample in No. 11 bin 
{Z{0,T) > 0.95), almost all subjects know the answer to the question and are independent 
[p ^ 0%). An extremely small value of Z{0,T) indicates some bias in the question. We 
omit data with Z{0,T) < 0.45(13/31) in the analysis of the system and we are left with 180 
(166) samples in EXP2011 (EXP2010). The samples with Z(0,T) < 0.5 in No. 6 bin in 
EXP2011 (No. 5 bin in EXP2010) have p values larger than 100%. These values are errors 
of the estimation p = 2(1 — Z{0,T)). The standard deviation of Z{0,T) is p/2\/T for fixed 
p. In the estimation of p, there is a fiuctuation with the magnitude of order ~ p/\/T. If p 
takes value larger than 100%, we take it to be 100%. Table UTTl shows the number of data 
samples in each bin for r = 0,5 and r = oo as A^(0),A^(5), and A^(oo). Social information 
causes remarkable changes in the subjects' choices. For r = (r = 5), there is one peak 
at No. 7 (No. 9), and for r = oo, there are peaks at No. 3 and No. 11 in EXP2011. 
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TABLE III. Effect of social information on subjects' decisions in (A) EXP2011 and (B) EXP2010. 
We divide the samples according to the size of Z{r,T). N{r) denotes the number of samples in 
each bin. pavg is estimated as the average value of p = 2(1 — Z{0,T)) in each bin. In the last 
column, the sub-optimal ratios are shown. 



(A) EXP2011 



No. 


Z{r,T)[%] 


iV(0) 


Pavg[%] 


N{5) 


iV(oo) 


Ratio 


1 


< 5 





NA 





2 


NA 


z 


~ io 


U 


M A 
i\ A 


4 


1 Q 


M A 
i\ A 


3 


15 ~ 25 


8 


NA 


8 


22 


8/8 


4 


25 ~ 35 


16 


NA 


21 


20 


13/16 


5 


35 ~ 45 


36 


NA 


24 


8 


28/36 


6 


45 ~ 55 


43 


96.7 


26 


9 


16/43 


7 


55 ~ 65 


46 


79.3 


34 


10 


9/46 


8 


65 ~ 75 


45 


62.7 


40 


14 


2/45 


9 


75 ~ 85 


33 


41.9 


45 


33 


0/33 


10 


85 ~ 95 


11 


21.3 


32 


67 


0/11 


11 


> 95 


2 


2.0 


6 


37 


0/2 


Total 




240 




240 


240 


76/240 


(B) EXP2010 


No. 


Z(r,31) 


iV(0) 


Pavg[% 


Nib) 


N{oo) 


Ratio 


1 


< 1/31 





NA 





3 


NA 


2 


2/31 ~ 5/31 


2 


NA 


7 


20 


2/2 


3 


6/31 ~ 9/31 


6 


NA 


18 


22 


6/6 


4 


10/31 ~ 13/31 


26 


NA 


21 


14 


23/26 


5 


14/31 ~ 17/31 


52 


97.6 


33 


6 


26/52 


6 


18/31 ~ 21/31 


54 


74.6 


21 


7 


5/54 


7 


22/31 ~ 25/31 


33 


49.1 


35 


29 


1/33 


8 


26/31 ~ 29/31 


25 


26.3 


47 


67 


0/25 


9 


> 30/31 


2 


6.5 


18 


32 


0/2 


Total 




200 




200 


200 


63/200 



Here, we compare the densities, not the value N{r) itself. In the last column, we show the 
ratio of sub-optimal cases {Z{oo,T) < 1/2} with respect to the A^(0) samples in each bin. 
The crucial problem is whether the sub-optimal cases \Z(oo,T) < 1/2} remain so in the 

nn 

thermodynamic limit T — t- oo [7|, l20| . 

In order to see the social influence more pictorially, we show the scatter plots of Z{0, T) 
vs Z{r,T) for all 240 samples in EXP2011 for each r G {1{A),5{B),11{C),21{D), oo{E)} 
in Fig. [3l The x-axis shows Z{0,T) and the y-axis shows Z{r,T). The vertical lines show 
the boundary between the bins (from No. 1 to No. 11) in Table UTTl A. As we move from 
Fig. [3JA. (r = 1) to Fig. |3l5 (r = oo), the amount of social information r increases. If the 
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FIG. 3. Scatter plots of Z{0,T) vs Z{r,T) in EXP2011 for (A) r = 1, (B) r = 5, (C) r 
r = 21 and (E) r = oo. The vertical lines show the border of the bins in Table HIIl A. 



11, (D) 



subjects' answers are not affected by the social information, the data should distribute on 
the diagonal line. However, as the plots clearly indicate, this is not the case. As r increases 
from r = 1 to r = oo, the changes Z{r,T) — Z{0,T) increase and the samples scatter more 
widely in the plane. For the samples with Z{0, T) > 0.75 (Nos. 9, 10, 11 bins in Table UTTK ) . 
the changes are almost positive and Z{oo,T) takes a value of about one. The sub-optimal 
ratios are zero in the bins. The average performance improves by the social information 
there. On the other hand, for the samples with 0.45 < Z{0, T) < 0.55 (No. 6 bin in Table 
IIIIR ). the social information does not necessarily improve the average performance. There 
are many samples with negative change Z{oo,T) — Z{0,T) < 0. These samples are in the 
sub-optimal state and constitute the lower peak in Table IIIII A. 



B. Order parameters of the phase transition 

We have seen drastic changes in the distribution of Z{oo,T) from the distribution of 
Z{0,T). Table Uni shows the two-peaks structure in the distribution of Z{oo,T). In Fig. 
[31 we see an S-shaped curve in the case r = oo. The natural question is whether these 
macroscopic changes can be attributed to the information cascade transition. In our previous 



work on the voting model with digital herders 

12 



20| , we showed the possibility of the phase 



transition from the one-peak phase to the two-peaks phase. If p is smaller than some 
critical value pc, the system is in the one-peak phase. The distribution of Z(oo,T) has 
only one peak at Z(oo,T) > 1/2. If p > pc-, the distribution of Z(oo,T) has two peaks at 
Z(oo,T) < 1/2 and Z(oo,T) > 1/2. In the thermodynamic limit T — )■ oo, the probability 
that Z(oo,T) < 1/2 becomes a function of which is non-analytic a.t p = Pc and takes a 
positive value for p > pc. There are two candidates for the order parameter of the phase 
transition. One is the sub-optimal ratio Z(oo,T) < 1/2. The other is the variance of 
Z(oo,T), Var(Z(oo, T)). Both candidates are zero for p < Pc and positive for p > pc in the 
thermodynamic limit T — )■ oo. 
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FIG. 4. (A) The ratios of the sub-optimal (Z(oo,T) < 1/2) cases, which are given in the last 
column of Table Hill and (B) the variance Var(Z(oo, T)), are plotted vs p. The symbols o(A) 
indicate the results of EXP2011 (EXP2010). 



Fig. m shows the plot of the two order parameters vs p. We plot (A) the ratios of the 
sub-optimal (Z(oo,T) < 1/2) cases and (B) Var(Z(oo, T)). The ratios are given in the last 
columns of Table lllli As p increases, the order parameters change from zero to some finite 
value. They are monotonically increasing functions of p. However, in the behaviors, we 
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cannot see any clear evidence of the phase transition. The system size is very small and we 
cannot see any non-analytic nature there. We cannot use them to prove the existence of the 
information cascade transition. 



C. Asymptotic behavior of the convergence of Z{oo,t) 

We study the convergence of Z{oo,t) in the limit t — )■ oo to clarify the possibility of the 
information cascade transition. If information aggregation works under social information, 
Z{oo,t) converges to some value larger than half. The distribution of Z{r,t) has only one 
peak: it is in the one-peak phase. Depending on the convergence behavior, the one-peak 
phase is classified into two phases. If the variance of Z{r, t) shows normal diffusive behavior 
as Var((Z(r, t)) oc t~^, it is called the normal diffusion phase. We note that the variance 
is estimated for the ratio, and the usual behavior for the sum of t random variables is 
replaced by oc t/t"^ = t^^. If the convergence is slow and it obeys Vai{Z{r,t)) oc t^"' with 



< 7 < 1, it is called the super diffusion phase 1^,12^. If information aggregation does not 
work and there is a finite probability that Z(r, t) converges to some value less than half, the 

nn 

distribution of Z{r,t) has two peaks [7|, 120]. It is in the two-peaks phase, and Var(Z(r, t)) 
converges to some finite value in the limit t — ?■ oo. 

Fig. [5JA. (B) shows the double logarithmic plots of Var(Z'(oo, t)) as a function of t for 
EXP2011 (EXP2010). If the plot of Var(Z(oo,t)) vs t has a negative slope (7 > 0) in the 
limit t — 7- 00, the system is in the one-peak phase. If the slope is zero (7 < 0) in the 
limit t — i- 00, the system is in the two-peaks phase. We see that the convergence becomes 
very slow as p increases. The exponent 7 is estimated by fitting with oc t~'^ for t > 20. It 
decreases from 1.43 (1.53) to 0.09 with the increase in p in EXP2011 (EXP2010). For the 
cases with p = 96.7%(97.6%)in EXP2011 (EXP2010), the system can be in the two-peaks 
phase. 



IV. STOCHASTIC MODEL AND SIMULATION STUDY 

The asymptotic analysis of the convergence of Z{oo,t) shows the possibility of the two- 
peaks phase in the cases p = 96.7% and 97.6%. The negative slope 7 is remarkably small 
(~ 0.09) in both experiments. However, the system sizes are limited and far from the 
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FIG. 5. Convergent behavior for (A) EXP2011 and (B) EXP2010. The convergence is given by the 
double logarithmic plot of Var(Z(oo,t)) vs t using the samples in the four bins for r = in Table 
iniK (No.6(o),7(A),8(o) and 9(x)) and in Table Im^ (No.5(o),6(A),7(o) and 8(x)). The dotted 
lines are fitted results with oc t~'^ for t > 20. 



thermodynamic limit T — )► oo. In this section, we derive a microscopic rule as to how the 
herders copy others' information. Based on the herder's microscopic rule, we introduce an 
ad hoc stochastic model. A simulation study of the model for 10^ subjects showed the 
information cascade transition. 



A. Microscopic behavior of herders 

We determine how a herder's decision depends on social information. For this purpose, 
we need to subtract the independent subjects' contribution from X{r,t+1). The probability 
of being independent is 1 — p and such a subject always chooses 1 ; the herder's decision is 
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then simply estimated as 
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(X(r,t + l)-(l-p)-l)/p. 

The expectation value of this under Ci(r, t) = rii indicates the probability that a herder 
chooses an option under the influence of the prior rii subjects among r choosing it. We 
denote it by qh{r,ni), and it is defined for t > r as 



g,(r, m) = E((X(r, t + 1) - (1 - p))/p\C,{r, t) = m). (1) 

The conditional expectation value in Eq.([T]) is estimated using the samples that satisfy 
Ci(r, t) = rii from EXP2011. From the symmetry between 1 ^ 0, we assume qh{r,ni) = 
1 — qh{r,r — rii). For r = oo, the t + 1-th subject obtains information from the previous t 
subjects and r is considered to be t. r = oo case is averaging several values of r. For r = 21 
and oo, we study the dependence of qh{r, rii) on rii/r and round rii/r to the nearest values in 
{k/ll\k G {0, 1, 2, ■ ■ ■ , 11}}. In addition, we estimate qh{oo, rii) for t > 22 to understand the 
herder's decision under the largest amount of social information. The second reason is that 
the subjects receives much information for t > 22 and we can assume that the dependence 
of qh{oo, Til) on {t, nl) is replaced with the dependence on nl/t. 



0.95 
0.9 

0.85 
_ 0.8 
cT 0.75 
^ 0.7 

IT 

0.65 

0.6 
0.55 

0.5 

0.5 0.6 0.7 0.8 0.9 1 




FIG. 6. Microscopic rule of herder's decision in EXP2011. It shows the probability qh{f,Tii) that 
a herder chooses an option under the influence of the prior ni subjects among r choosing it for 
cases r = l(x), 5(*), ll(o) and 21(A). For r = oo(o), r is considered to be t. The thin sohd curves 
are fitted results with Eq.(I5]) for r = 1,5,11, and 21 from the bottom. The top thick solid line 
corresponds to r = oo. 
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Figure [6] shows the plot of qhir, ni) vs rii/r. It is clear that qh{r, rii) is an almost monotonic 
increasing function of rii. As r increases, it shows stronger dependence on rii/r, and the 
herder's decision is affected more greatly by the prior subjects' choices. We fit the plot by 
the following functional form: 

gh(r,ni) = i (atanh(A,(ni/r - 1/2)) + 1) . (2) 

The parameters a and A,, indicate the strength of the conformity of the subjects. Social 
ogy studies suggest that people's likelihood to use social information depends on their 



psycho 

mood 10|. a denotes the net ratio of herders who react positively to the priors subjects' 
choices. A,, denotes the strength of the dependence on social information. By the least 
squares fit, we obtain a = 0.761 and Aoo = 3.80 for r = oo. The fitted result is also shown 
in Fig. 61 The values of a and Aoo depend on the experimental situation and on the system 
size T 26|. Using the same a, we fit the data for other r using Eq.([2]). The results are also 
given in Fig. [6l As the amount r of social information increases, the strength of dependence 
Xr increases. 



B. Information cascade transition of voting model 

To understand the behavior of the system in the thermodynamic limit T — )■ oo, we 
simulate the system for large T by a stochastic model based on Eq.([2]). We introduce 
a stochastic process {X(t)},t G {1,2,3, ■ ■ ■ ,T}. X(t + 1) G {0, 1} is a Bernoulli random 
variable and its probabilistic rule depends on all the previous {X{t')}, t' G {1, ■ ■ ■ ,t} through 
Ci{t) = Ylt'=i-^i^')- '^^^ probability that X{t + 1) is 1 for Ci{t) = rii, which is denoted as 
q{t, rii), is given as 

q{t, ni) = {l-p)+p-^{a tanh Xt{ni/t - 1/2) + 1) . (3) 

We set a as a = 0.761 and A^ for t G {1,5, 11,21} as A^ = A^. Here, A^ takes the values 
given in Fig. |6l For other values of t and t < 21, we use the linearly extrapolated value. For 
t > 21, we set At = Aoo- 

We denote the probability function Pr(^t'=i ^('^0 ~ ^) P{t,n). The master equation 
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FIG. 7. Plots of order parameters vs p for the voting model and the limit T — )■ oo. We plot (A) the 
sub-optimal ratio Z(oo,T) < 1/2 and (B) Var(Z(oo,T)) vs p; here, T is the length of the sequence. 
The symbol (o) indicates the experimental data for EXP2011(r = oo) (T ~ 50) for the five bins 
No. 6, 7, 8, 9, and 10 in Table Hill The lines show the data from the stochastic model for r = oo. 
For the stochastic model, we set T = 50 (thick solid hne), 10^ (thin solid), and 10^ (thick dotted). 
As T increases, the non-zero regions of the order parameters move rightward. In the limit T — ?• oo, 
the region reduces to p > pc = 93. 4%. The vertical chain line at p = pc shows the critical point of 
the phase transition between the one-peak phase (p < pc) and the two-peaks phase {p > pc)- 



for P(t, n) is 

P{t + 1, n) = g(t, n - 1) ■ P(t, n - 1) + (1 - g(t, n)) ■ P{t, n). (4) 

We solved the master equation recursively and obtain P{t,n) for t < 10^. 

Fig. [7] shows the results of the model. We plot (A) the sub-optimal ratio Z{oo, T) < 1/2 
and (B) Var(Z(oo, T)) vs p. For comparison, we plot the experimental results (EXP2011) 
using the symbol o. The model with T = 50 well describes the experimental results quan- 
titatively. In the figure, we also plot the results for T = 10'^ and 10®. As T increases, the 
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non-zero regions of the order parameters move rightward. In particular, we see that there 
is a crossing point in the curves of Var(Z(oo, T)) at p = 93.4%, which we denote as pc- If 
p < Pc, Var(Z(oo, T)) goes to zero as T increases. On the other hand, if p > Pc, the variance 
seems to remain in the hmit T — t- oo. This shows the phase transition between the one-peak 
and the two-peaks phases. For p > p^, the system is in the two-peaks phase (7 = 0). For 
p < Pc, the system is in the one-peak phase (0 < 7 < 1). In order to see the convergence 
rate (the exponent 7), it is necessary to study the asymptotic behavior of Var(Z(oo,t)) for 
large values of t. 
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FIG. 8. Thermodynamic limit and phase diagram. Asymptotic behavior of the convergence is 
given by the plot of 7 vs p. The symbols (o) show the 7s vs pavg in Fig. [5]A.. The lines show the 
results of the stochastic model with the system size T = 50 (thick solid), 10'^ (dotted), and 10^ 
(thick chain). 



We estimate 7 from the slope of Var(Z(t)) as 

1 yar(Z(T-AT)) T 
^ = Var(Z(T)) /^"^T^AT 

for the time horizons T = 50, 10^ and 10^. For T = 50, we take AT = 30 to match the 
analysis of the experimental data in EXP2011. For T = 10^ and 10^, we take AT = 10^. The 
results are summarized in Fig. |H1 For T = 50, 7 is a monotonic decreasing function of p and 
it describes the experimental results of EXP2011 well. For the limit T — )■ 00, we compare 
the results with T = 50, 10^, and 10^. 7 shows non-monotonic behavior as a function of p 
for the latter two cases; it is an artifact of finite T. In the limit T — )■ 00, 7 monotonically 
decreases from 1 to 0, and the threshold value is Pc = 93.4% '2^. For p < Pc {p > Pc), the 
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system is in the one-peak (two-peaks) phase. 



V. CONCLUSIONS 

The instinct to imitate others led to the remarkably slow convergence of information 
aggregation as the herder's ratio p approached 100%. A stochastic model based on the 
herder's microscopic copying rule predicted the information cascade transition between the 
one-peak {p < pc) and the two-peaks phases {p > pc) j^]- In the one-peak phase, information 
aggregation works and the majority's choice is always correct when the independents choose 
the correct choice rather than the incorrect one. In the two-peaks phase, the majority's 
choice is not necessarily correct. The coexistence of the optimal (Z(cxd,T) > 1/2) and the 
sub-optimal (Z(oo,T) < 1/2) states occurs there. 

It has been thought that information cascade was fragile js], 0| or self- correcting where 
the sub-optimal state disappears and switches to the optimal state jl8|. Our study indi- 
cates that the system is in the two-peaks phase and that the sub-optimal state is stable 
against small perturbations for p > pc the subjects are given the summary statistics 
{Co(oo, t), Ci(oo, t)}. The conclusion might appear to be contradictory to the previous one, 
but this is not so. In the previous works, each subject has his own information and it is not 



necessary to follow the majority if one can trust his own information |18|. In addition, the 
social information is the time series of the previous choices {X(oo, t')}i<t'<f, and it contains 
much more information than the summary statistics {Co(oo, t), Ci(oo, t)}. In our experi- 
ment, the herder does not have information and it is necessary to follow the majority if he 
wants to choose a correct answer. However, the system size in our experiment is very limited 
and it is difficult to infer the state of the system in the thermodynamic limit based only 
on experimental data. Our conclusion that there occurs a information cascade transition 
relies heavily on the results of the simulation study of the stochastic model. In addition, 
the experiments were performed with students at universities, and the scope of the subjects 
is thus very restricted. The robustness of the conclusion should be established by further 
experiments. For this purpose, a web-based experiment in artificial laboratories is promising 



28| . There, we can approach the thermodynamic limit T — )■ oo more easily than in physical 



laboratories and study the micro-macro feature of information cascade. 
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Appendix A: Experimental Setup 
1. EXP2010 

In EXP2010, the 62 subjects who participated in the experiment were recruited from the 
School of Science of Kitasato University. The subjects were randomly assigned to either 
group A or group B; each group had 31 subjects. In each session, one subject from each 
group entered the laboratory, a total of two subjects. We explained that we were studying 
how their choices were affected by the choices of others. After explaining the details of the 
experimental procedure and payment, each subject sat in front of an experimenter and had 
no contact with the other subject in the laboratory. The experiments on groups A and B 
were performed independently. Interaction between subjects in each group was permitted 
only through the social information given by the experimenter in front of each subject. 
The subjects were asked to answer the 100 questions in the two-choice quiz. The subjects 
answered the quiz with at most eight social influence conditions, including the case r = 0. 
Each session lasted about one hour. In order to obtain data from all T = 31 subjects in both 
groups, we performed the session 11 times on October 9 and and 10 times on October 16 
and 23 in 2010. Subjects were paid upon being released from the session. There was a 2000 
yen (about $24) participation fee and an additional 1000-yen reward (about $12) for the 
top ten subjects. The ranking of the subjects was calculated based on the ratio of correct 
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answers to all questions and r. 



2. EXP2011 

In EXP2011, the 104 subjects who participated were recruited from the Literature De- 
partment of Hokkaido University. The subjects were randomly assigned to either group A 
or group B; each group had 52 subjects. In each session, between one and six subjects 
from each group entered the laboratory, for a total of between two and eleven subjects. 
The experiment was performed in the Group Experiment Laboratory of the Center for Ex- 
perimental Research in Social Sciences of Hokkaido University. There were fifteen desks 
furnished with partitions and PCs. Each subject sat behind a partition and communication 
among them was prohibited. The experiments on groups A and B were performed inde- 
pendently. In order to study the effect of social information on the choices of the subjects, 
it was necessary to control the transmission of information from others in the same group. 
We developed a web-based voting system by which multiple subjects could simultaneously 
participate in the experiment. The subjects used a web browser to access the web voting 
server in the intranet. They could obtain information about the others' choices from the 
summary statistics {Co{r,t),Ci{r,t)} shown on the monitor. Subjects did not know which 
questions the other subjects were answering or what their choices were. 

With slides, we showed subjects how the experiment would proceed. We explained that 
we were studying how their choices were affected by the choices of others. In particular, 
we emphasized that the social information was realistic information calculated from the 
choices of previous subjects. The reason for this is that in contrast to EXP2010, the social 
information was given on the monitor and seemed less credible to the subjects. Through the 
slides, we also explained the payment. After the explanation, the experiment started. The 
subjects answered the 120-question quiz with at most six social influence conditions within 
about one hour. A total of 15 sessions were held on June 15 and 16 and from July 12 and 
13 in 2011. Subjects were paid in cash upon being released from the session. There was a 
600 yen (about $7) participation fee and additional rewards that were proportional to the 
number of correct answers. One correct choice was worth one point, and this was worth one 
yen (about one and a third cents). 

Some subjects could not answer all the questions within the alloted time, so the number 
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FIG. 9. Microscopic rule of herders' decisions in EXP2010. It shows the probabihty qh{r,ni) that a 
herder chooses an option under the influence of the prior rii subjects among r choosing it for cases 
r = l(x ), 3(*), 5(o) and 9(A). For r = oo(o), the t + 1-th subject's decision depends on previous 
t subjects' decisions and r is considered to be t. We estimate the dependences of (7/1(00, ni) on 
ni/t for t > 10, where ni/t is rounded to the nearest value in {k/9\k £ {0, 1, 2, • • • , 9}}. The solid 
curves are fitted results with qh{r,ni) = (a tanh(Ar-(ni/r — 1/2)) + 1)/2. For r = 00 (the thick solid 
line), we set r = t. From bottom to top, the thin solid lines correspond to r = 1, 3, 5, and 9. 

T of subjects who answered a question varied. The distribution of T is #{T = 52} = 
143, #{T = 51} = 5, #{T = 50} = 2, #{T = 49} = 85, #{r = 48} = 4, #{T = 47} = 1. 
The average value of T was 50.8 and the standard deviation ax was 1.48. The average length 
of the sequence was about 50. 

The assignment of quiz questions in EXP2011 was as follows. Each question has a label 
g in g G {0, 1,2, ■ ■ ■ , 119} and each subject has an ID number i. If the ID number i of 
the subject was odd, the subject started with the question with the smallest q in the pool 
not chosen by the server for another subject in the same group. If i was even, the subject 
started with the question with the largest q not chosen by the server for another subject. If 
another subject was answering a question, that question was never chosen by the server. It 
was strictly prohibited for multiple subjects to answer the same question simultaneously. 



Appendix B: Microscopic behavior of herders in EXP2010 

We determine how a herder's decision depends on social information in EXP2010. We 
follow the same procedure written in the main text. Fig. [9] plots the results. The fitted 
results with Eq.([2]) are also shown. Compared with Fig. [6l the convergence to Eq.([2]) is not 
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so good. The system size T = 31 is not enough to derive the microscopic rule. 



Appendix C: Estimate of 7 and its error bar 

In our study, we focus on the asymptotic behavior of Var(Z(oo, t)). In particular, we are 
interested in the power law behavior as Var(Z(oo, t)) oc t~'^. The negative slope —7 of the 
double logarithmic plot of Var(Z(oo, t) vs t gives the exponent 7 in the limit t — )■ 00. In our 
analysis, we estimate 7 by the least squares fit with the functional form a ■ t~'^ in the range 
20 < t < T. We denote the estimate as '~iexp- 

For the error bar of the exponent 7exp, we adopted the voting model to simulate the 
system and apply the parametric bootstrapping method based on it. First, we solved the 
model recursively up to T = 50 — 30 and T = 50 and obtained the probability functions 
P{T,n) for both time horizons T. We defined Z{T) as Z{T) = ^ and estimated 7 by the 
relation 

1 Var(Z(T-AT)) T 
^ = Var(Z(T)) /^"^T^AT ^^'^ 

using the probability function P{T, n). Here, 7 was estimated in the range T — AT < t < T. 
For T = 50, we take AT = 30 to match the analysis of the experimental data in EXP2011. 
The estimate is exact, and we denote it as jexact- In EXP2011, the number of samples in 
each bin ranges from 33 to 46, and is very limited. We studied the stochastic model using 
the Monte Carlo method with the same sample size as the experimental data in each bin and 
estimated 7 for the samples. We repeated this 10^ times to obtain the samples of 7. Using 
the samples of 7, we estimated the 95% confidence interval, which is denoted as [7., 7+]. 
The approximately estimated 7 was distributed around •y exact- The upper (lower) deviation 
was calculated as A7+ = 7_,_ — 'jexact (^^7- = lexact — 7-)- We estimated the 95% confidence 
interval of •jexp as [jexp — A'~^-^lexp + ^7+]- Using this procedure, we estimated the error 
bars for each 7 in Fig. [81 



25 



1 

0.9 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 
0.1 



A vs B,p=0.82 



tt 



+++ ++ t 



+ ++ +t + + _ 



1.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Z(0,T) 



